255 results found.
Speech/Written
Corpus,
Language Type:
Multilingual
Languages:
Hausa Korean Mandarin Chinese Thai Vietnamese
Availability:
From Data Center(s)
License:
ELRA, Appen
Size:
60 GByte Production Status:
Existing-updated
Use:
Multilingual Speech Processing incl. multilingual speech recognition, rapid deployment of speech processing systems, language identification,
-
Paper title:GlobalPhone: Pronunciation Dictionaries in 20 Languages
-
Paper track:Speech
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Tanja Schultz | Karlsruhe Institute of Technology | DE |
| Author 2 | Tim Schlippe | Karlsruhe Institute of Technology | DE |
| Main Contact | Tanja Schultz | Universität Bremen | None |
Documentation:
yes; English; in part in various publications from our group
Written/Multimodal
Ontology,
Language Type:
Multilingual
Languages:
English Mandarin Chinese Spanish italian
Availability:
From Owner
License:
N.A.
Size:
1120 concepts Production Status:
Existing-used
Use:
Word Sense Disambiguation
-
Paper title:From Synsets to Videos: Enriching ItalWordNet Multimodally
-
Paper track:Written
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Roberto Bartolini | ILC CNR Pisa | IT | ||
| Author 2 | Valeria Quochi | <Not Specified> | None | Consiglio Nazionale delle Ricerche, Istituto di Linguistica Computazionale "A. Zampolli" | IT |
| Author 3 | Irene De Felice | ILC CNR Pisa | IT | ||
| Author 4 | Irene Russo | ILC CNR | IT | ||
| Author 5 | Monica Monachini | Institute of Computational Linguistics - CNR | IT | ||
| Main Contact | Irene Russo | ILC CNR | None |
Documentation:
Documentation is publicly available in Italian and English from the project website
Written
Corpus,
Language Type:
Multilingual
Languages:
English Mandarin Chinese Spanish french
Availability:
From Owner
License:
None
Size:
46 GByte Production Status:
Newly created-finished
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:The United Nations Parallel Corpus v1.0
-
Paper track:Written
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Michał Ziemski | United Nations | US |
| Author 2 | Marcin Junczys-Dowmunt | Adam Mickiewicz University, Poznań | PL |
| Author 3 | Bruno Pouliquen | World Intellectual Property Organization | CH |
| Main Contact | Marcin Junczys-Dowmunt | Adam Mickiewicz University, Poznań | None |
Documentation:
http://conferences.unite.un.org/UNCorpus
Written
Corpus,
Language Type:
Multilingual
Languages:
English Indonesian Japanese Mandarin Chinese
Availability:
Freely Available
License:
CreativeCommons Attribution (CC BY)
Size:
7093 sentences Production Status:
Newly created-in progress
Use:
Language Modelling
-
Paper title:Building The Sense-Tagged Multilingual Parallel Corpus
-
Paper track:Infrastructural Issues/Large Projects
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Shan Wang | Nanyang Technological University | MO |
| Author 2 | Francis Bond | Nanyang Technological University | SG |
| Main Contact | Shan Wang | University of Macau | None |
Documentation:
Yes
Multimodal/Multimedia
Ontology,
Language Type:
Multilingual
Languages:
English Hindi Mandarin Chinese Spanish italian
Availability:
Freely Available
License:
Creative common license
Size:
<Not Specified> Production Status:
Newly created-in progress
Use:
<Not Specified>
-
Paper title:From Visual Prototypes of Action to Metaphors: Extending the IMAGACT Ontology of Action to Secondary Meanings
-
Paper track:<Not Specified>
-
Paper status:Accept
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Susan Windisch Brown | University of Colorado at Boulder | US | University of Florence | US |
| Main Contact | Susan Windisch Brown | University of Colorado at Boulder | None |
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
Mandarin Chinese
Availability:
From Owner
License:
Free for academic research
Size:
10 hours Production Status:
Newly created-finished
Use:
Language learning
-
Paper title:Spontaneous Speech Corpora for language learners of Spanish, Chinese and Japanese
-
Paper track:Speech
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Antonio Moreno-Sandoval | Universidad Autónoma de Madrid | None | ||
| Author 2 | Leonardo Campillos | Universidad Autónoma de Madrid | None | UAM | ES |
| Author 3 | Yang Dong | Universidad Autónoma de Madrid | None | ||
| Author 4 | Emi Takamori | Universidad Autónoma de Madrid | None | ||
| Author 5 | José M. Guirao | Universidad de Granada | None | ||
| Author 6 | Paula Gozalo | Universidad Autónoma de Madrid | None | ||
| Author 7 | Chieko Kimura | Universidad Autónoma de Madrid | None | ||
| Author 8 | Kengo Matsui | Tokyo University of Foreign Studies | None | ||
| Author 9 | Marta Garrote | Universidad Autónoma de Madrid | None | ||
| Main Contact | Antonio Moreno-Sandoval | Autonomous University of Madrid | ES |
Documentation:
Yang Dong's Ph.D. thesis, in SpanishLanguage Type:
Multilingual
Languages:
English Mandarin Chinese
Availability:
From Data Center(s)
License:
LDC
Size:
100000 words Production Status:
Newly created-finished
Use:
Information Extraction, Information Retrieval
-
Paper title:Parallel Chinese-English Entities, Relations and Events Corpora
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Justin Mott | Linguistic Data Consortium | US | ||
| Author 2 | Ann Bies | Linguistic Data Consortium, University of Pennsylvania | US | Linguistic Data Consortium | US |
| Author 3 | Zhiyi Song | Linguistic Data Consortium | US | ||
| Author 4 | Stephanie Strassel | Linguistic Data Consortium, University of Pennsylvania | US | ||
| Main Contact | Justin Mott | Linguistic Data Consortium | None |
Documentation:
<Not Specified>
Written
Corpus,
Language Type:
Multilingual
Languages:
Mandarin Chinese
Availability:
From Owner
License:
<Not Specified>
Size:
<Not Specified> <Not Specified>Production Status:
Existing-used
Use:
Discourse
-
Paper title:Fine-Grained Chinese Discourse Relation Labelling
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Huan-Yuan Chen | National Taiwan University | TW |
| Author 2 | Wan-Shan Liao | National Taiwan University | TW |
| Author 3 | Hen-Hsen Huang | National Taiwan University | TW |
| Author 4 | Hsin-Hsi Chen | National Taiwan University | TW |
| Main Contact | Hsin-Hsi Chen | National Taiwan University | None |
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
Mandarin Chinese
Availability:
From Owner
License:
<Not Specified>
Size:
45000000 words Production Status:
Newly created-finished
Use:
Parsing and Tagging
-
Paper title:A Dependency Treebank of the Chinese Buddhist Canon
-
Paper track:Written
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Tak-sum Wong | City University of Hong Kong | HK |
| Author 2 | John Lee | City University of Hong Kong | HK |
| Main Contact | John Lee | City University of Hong Kong | None |
Documentation:
<Not Specified>
Written
Lexicon,
Language Type:
Multilingual
Languages:
Mandarin Chinese
Availability:
Freely Available
License:
<Not Specified>
Size:
27221 lexemes Production Status:
Newly created-finished
Use:
Lexicon Creation/Annotation
-
Paper title:ANTUSD: A Large Chinese Sentiment Dictionary
-
Paper track:Written
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Shih-Ming Wang | Institute of Information Science - Academia Sinica | TW |
| Author 2 | Lun-Wei Ku | Academia Sinica | TW |
| Main Contact | Shih-Ming Wang | Institute of Information Science - Academia Sinica | None |
Documentation:
<Not Specified>




